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Abstract. A solution for Smale's 17th problem, for the case of systems with bounded 
degree was recently given. This solution, an algorithm computing approximate zeros 
of complex polynomial systems in average polynomial time, assumed infinite precision. 
In this paper we describe a finite-precision version of this algorithm. Our main result 
shows that this version works within the same time bounds and requires a precision 
which, on the average, amounts to a polynomial amount of bits in the mantissa of the 
intervening floating-point numbers. 
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1 Introduction 

The 17th of the problems for the 21st century posed by Steve Smale [12] asks for an algorithm 
computing an approximate zero of a polynomial system in average polynomial time. 

The problem had occupied Smale during the 1990's and led him, together with Mike 
Shub, to a series of papers [13l [141 [13 [13 [IB [H] — known as the Bezout series — where 
a number of ideas and results approaching a solution for the 17th problem were proposed. 
These ideas arc at the core of all further research done on Smale's problem. 

Paramount within this research is the work of Carlos Beltran and Luis Miguel Pardo [5J 
[21 [3| who provided a randomized algorithm computing the desired approximate zero in 
average expected polynomial time. Here the word "average" refers to expectation over the 
input data and the word "expected" to expectation over the random choices made by the 
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algorithrr0. One can say that they gave a probabihstic solution to Smale's 17th problem. 
Further results, including a deterministic algorithm working in average time _/V'^('°siog^') 
— referred to as "nearly polynomial" — are given in [B]. This deterministic algorithm, when 
restricted to systems with bounded (or even moderately growing) degree, becomes an average 
polynomial-time algorithm, refered to in as MD. 

All the mentioned work (as well as all the work on Smale's 17th problem not mentioned 
above) assume infinite precision. As Beltran and Pardo put it in [4l p. 6] 

With the assumption of exact arithmetic [. . . ] the homotopy method [. . . ] is 
guaranteed to produce an approximate zero of /. 

This statement begs the question, what can one do if (as it happens with digital computers) 
only finite precision is availablqj? The goal of the present paper is to give an answer for 
systems of moderate degree. 

The distinctive feature of a finite precision algorithm is the presence of a real number 
u G (0,1), called round-off unit, with the property that all occurring numbers x in the 
computation are replaced by a number r„(a;) (the rounding of x) such that |a; — r„(a;)| < u\x\ 
(a more detailed account on finite-precision computations is in tj3.1|) . Algorithms where u 
remains fixed through the computation are said to have fixed precision. Otherwise, they 
have variable precision. In this paper, we describe and analyze finite-precision versions for 
both settings — we denote them by MDFix and MDVar, respectively — of algorithm MD. The 
rationale for the consideration of both settings will be made clear soon enough. 

Our results are of a probabilistic nature (often refered to as an "average-case analysis" ) . 
They therefore require a probability measure on the space of data. We next describe the 
measures we use. 

Let H(d) denote the linear space of complex polynomial systems / = (/i, . . . , /„) with 
homogeneous of degree di in n-l- 1 variables. Given a round-off unit u, each system / g ^(d) 
is rounded to a system r^{f). The data space corresponding to this precision is therefore 

H«:= {tuU) I / eH(d)}. 

Clearly we have a surjective function rj^ : 'H(d) T^u. The measure we endow T-Lu with 
is the push-out Vu of the standard Gaussian ^ in H(d)- That is, we endow 'H(d) with a 
standard Gaussian distribution (with respect to the Bombieri-Weyl basis in 'H(d), see ^2.1\ 
for details) and we define, for all Borelian subset A C "Hit, Vu{A) :— ii{r^^{A)). With the 
measure at hand wc can state our first main result. 

Theorem A. There exists a fixed precision algorithm MDFix satisfying the following. For 
a random input f G "Hu, algorithm MDFix returns an approximate zero of f with probability 
at least 

V log(l/^) ) ■ 

^Although technically similar, there is a remarkable difference between the probability distributions 
considered. The one for the input data is, explicitly or otherwise, claiming some closeness to the distribution 
of data "in practice." The only requirement for the distribution for the random choices of the algorithm is, 
in contrast, that it will be efficiently computable. An undisputed merit of the work of Beltran and Pardo is 
to come up with one distribution which is so and, at the same time, allows one to derive complexity bounds. 

^Incidentally, finite precision analysis for algorithms dealing with multivariate polynomial systems was 
pioneered by Steve Smale, dragging on the way one of the authors of the present paper, in [8]. 
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Otherwise, MDFix returns a failure message. The number of arithmetic operations performed 
is bounded as 



While the consideration of fixed precision is a rcahstic approach, a resuh such as Theo- 
rem A is not without shortcomings. Note that MDFix does not always return an approximate 
zero of its input /, and it fails to do so with positive probability. And, unfortunately, the 
lower bound for the probability of success shown in Theorem A becomes meaningless when 
N grows. In other words, a fixed precision u puts limits on the size of the systems for which 
one can expect MDFix to succeed. 

The relationship between input size, round-off unit, and probability of success implicit 
in Theorem A can be better expressed with the use of variable precision and the possibility 
of continuously adjusting the input reading to the current round-off unit. This is a less 
realistic setting but it pays off in terms of understanding. We next see how. 

We assume variable precision. That is, algorithms now start with an initial round-off 
unit and they have the capability to refine this parameter as the execution proceeds. In 
this context we will be interested in the smallest value attained by the round-off unit 
during the execution. A bound on amounts to a bound on the number of bits (or digits) 
required to store floating point approximations of the complex numbers occurring during 
the computation and, in this sense, is related to the bit-cost of performing the computation. 
In fact, the maximum number of bits we will need for each such approximation is essentially 
|logw*|. 

An issue naturally raised by the assumption of variable precision is the space of data from 
which algorithms will take their inputs. The spaces are appropriate for the fixed precision 
setting: systems / are read with the precision used throughout the computation. They 
do not appear to be so for the variable precision context. But finite-precision algorithms 
cannot take inputs with infinite-precision entries. An elegant solution for this situation is 
the consideration of black-boxes. These are theoretical devices which, as we said, are not 
realistic. But they do allow for the statement of results highlighting in a clear manner the 
relationship between precision needed and input size. 

In what follows, to every / g ^(d) we associate a routine read_input/ such that 
read_input / ( ) returns an approximation of / with the current round-off unit u. It is 
this routine what is given as input to our variable precision algorithm MDVar. Because of 
the bijection between this set of routines and the space 'H^d) may (and will) abuse lan- 
guage and take H(d) as the space of data for MDVar. In particular, we will endow this space 
of data with the standard Gaussian mentioned above, which we will denote by A^(0,ld). 

Our second main result is the following. 

Theorem B. There exists a variable precision algorithm MDVar satisfying the following. 
When f is randomly chosen from N{0., Id), algorithm MDVar on input f stops almost surely, 
and when it does so, returns an approximate zero of f . The number of arithmetic operations 
costMDVar(/) of MDVar on input f is bounded on the average as 



\nVD{\ogN + D + 72'^)uJ ' 
Here N = dime ^(d) denotes the size of the input systems and D := maxjdi, . . . 




dn}- 



E COStMDVar(./) = 0{D'- 



'3iV2(n + 1)^+1). 



/~JV(0,Id) 
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Furthermore, the finest precision u^(f) used by MDVar on input f is bounded on the average 
as 

E \og\u,if)\^OiD^N{n + lf+^) 

/~Af(0,Id) 

and as a consequence, when D is bounded, the bit-cost of MDVar is, on the average, polyno- 
mial in the size N of the input. 

Before proceeding with the technicalities, a couple of remarks are in place. 

(1) As mentioned above, we confirm that Theorem B adds understanding to the situa- 
tion depicted in Theorem A. Indeed, in the variable precision context the algorithm returns 
an approximate zero almost surely (that is, the probability of failure is now zero). Further- 
more, the complexity (understood as number of arithmetic operations performed) of MDVar 
remains essentially the same as that of MD. Finally, the relationship between precision and 
input size is made clear: we exhibit polynomial bounds (in the input size N) for the expected 
(over random input systems /) number of bits necessary to carry out the computation. 

(2) Both Theorems A and B show only the existence of algorithms MDFix and MDVar, 
respectively. We do not fully exhibit these algorithms in this paper. For such a complete 
description we should provide the exact values of some constants occurring within the 'big 
Oh' notation in a few intermediate results in our development. This is certainly possible 
but we believe that doing so would only degrade our exposition. Because, on the one hand, 
it would bring undue focus on a marginal issue and, on the other hand, the length of the 
exposition would unavoidably increase in a non-trivial way. We follow in this sense a well 
established tradition in finite-precision analyses, by which the goal is the understanding of 
the magnitude of the precision needed by an algorithm. In particular, we do not intend 
algorithms MDFix and MDVar to be actually implemented. They are simply vehicles to 
understand the behavior of the (already implemented) algorithm MD and in this sense we 
make ours the words of Wilkinson quoted by Higham to open the tenth chapter of [10] : 

All too often, too much attention is paid to the precise error bound that has been 
estabhshed. The main purpose of such an analysis is either to establish the essential 
numerical stability of an algorithm or to show why it is unstable and in doing so to 
expose what sort of change is necessary to make it stable. The precise error bound is 
not of great importance. 

2 Preliminaries 

2.1 Setting and Notation 

For d E N wc denote by Hd the subspace of C[Xo, . . . , X„] of homogeneous polynomials of 
degree d. For / G Hd wc write 

where a = {ag, . . . , a„) is assumed to range over all multi- indices such that |a| = Y^^=o '^k = 
d, (^) denotes the multinomial coefficient, and X" := Xq"X"^ ■ ■ ■ X^'\ That is, we take for 

1/2 

basis of the linear space Hd the Bombieri-Weyl basis consisting of the monomials (^) X". 
A reason to do so is that the Hermitian inner product associated to this basis is unitarily 
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invariant. That is, if g G Hd is given by g{x) = (^) ^q^", then the canonical 
Hcrmitian inner product 

(/, 9) = ^ aa ba 

satisfies, for all elements v in the unitary group U{n + 1), that 

if, 9) = {f °>^,9°v). 

Fix di, . . . , c?„ e N \ {0} and let 'H(d) = Hd^ x . . . x Hd„ be the vector space of polynomial 
systems / = (/i, . . . , /„) with fi £ C[Xo, . . . , Xn] homogeneous of degree di. The space 
?^(d) is naturally endowed with a Hermitian inner product (/, p) = J2'i=iifi^ 9i) ■ denote 
by 11/11 the corresponding norm of / S 'H(d)- 

We let N := dimc'H(d), D :— maxj di, and V :— Yl- di- Also, in the rest of this paper, 
we assume di > 2 for all i < ^ (linear equations can be easily eliminated). In particular, 
D>2. 

Let P" := P(C"+^) denote the complex projective space associated to C"+^ and S'('H(d)) 
the unit sphere of 'H(d)- These arc smooth manifolds that naturally carry the structure of a 
Riemannian manifold (for P" the metric is called Fubini-Study metric). We will denote by 
dp and dg their Riemannian distances which, in both cases, amount to the angle between 
the arguments. Specifically, for x,y S P" one has 



Occasionally, for f,g £ T~L{A) \ {0}, we will abuse language and write ds{f,g) to denote this 

11/11 

Vr {(/, C) e H(d) X P" I / ^ and /(C) = 0}. 



angle, that is, the distance c?s(j^, -Jlj'j = (is(/, g). We define the solution variety to be 



This is a smooth submanifold of 'H(d) x P" and hence also carries a Riemannian structure. 
We denote by Vr{f) the zero set of / G H(d) in P". 

By Bezout's Theorem, Vp(/) contains V points for almost all /. Let Df{Q\T^ denote the 
restriction of the derivative of /: C"+^ ^ C" at C to the tangent space := {v E C"+^ | 
(f, C) = 0} of P" at C- The subvariety of ill-posed pairs is defined as 

E'p :={(/,C)el4|rankA/(C)|T, <n}. 

Note that (/, C) ^ Sp means that C is a simple zero of /. In this case, by the implicit function 
theorem, the projection Vr — >■ H(d), (<?,a;) 1— > g can be locally inverted around (/, C)- The 
image E of Ep under the projection Vp — > 'H{d} is called the discriminant variety. 

2.2 Approximate Zeros, Complexity and Data Distribution 

In [11], Mike Shub introduced the following projective version of Newton's method. We 
associate to / £ ^(d) (with Df{x) of rank n for some x) a map N f : C"+^\{0} C"+^\{0} 
defined (almost everywhere) by 

Nj{x) = x~Df{x)^^J{x). 
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Note that Nf{x) is homogeneous of degree in / and of degree 1 in a: so that Nf induces 
a rational map from P" to P" (which we will still denote by Nf) and this map is invariant 
under multiplication of / by constants. 

We note that Nf{x) can be computed from / and x very efficiently: since the Jacobian 
Df{x) can be evaluated with 0{N) arithmetic operations [T], one can do with a total of 
0{N + n^) — 0{N) arithmetic operations, the equality since di> 2 implies = n{n^). 

It is well-known that when x is sufficiently close to a simple zero C of f, the sequence 
of Newton iterates beginning at x will converge quadratically fast to C- This property led 
Steve Smale to define the following intrinsic notion of approximate zero. 

Definition 2.1. By an approximate zero of / e 'H(d) associated with a zero C e P" of / we 
understand a point x e P" such that the sequence of Newton iterates (adapted to projective 
space) 

Xi+i Nf{xi) 

with initial point xq :— x converges immediately quadratically to i.e., 

dr{xi,C) < y-^) dr{xo,0 

for all i € N. 

It is this notion of approximation that is referred to in the statement of Smale's 17th 
problem. 

The last notion necessary to formally state Smale's problem is that of 'average cost'. 
For the cost of a computation Smale proposes the number of arithmetic operations (this 
includes comparisons and possibly square roots) performed during the computation. In 
the case of a finite-precision algorithm one needs to multiply this number by the largest 
number of bits (or digits) necessary to approximate the complex numbers occurring during 
the computation. 

The word 'average' refers to the standard normal distribution for the data (input) system 
/ G 'H(d). Recall, we express an element / G 'H(d) as a linear combination of the monomials 
in the Bombieri-Weyl basis. The standard normal distribution corresponds to choosing the 
coefficients in this combination independently and identically distributed from the centered 
Gaussian distribution on C (which in turn amounts to draw real and imaginary parts inde- 
pendently from the centered Gaussian distribution on M). We denote this distribution on 
•H(d) by iV(0,Id). 

Hence, if cost(/) denotes the cost of computing an approximate zero for / with a given 
algorithm then the average cost of this algorithm, for inputs in 'H(d), is given by the expected 
value 

E cost(/). 

/~W(0,Id) 

We remark that if the cost is homogeneous of degree zero, that is, if cost(/) = cost(A/) for 
all A 7^ 0, then the expectation above is the same as the expectation with / drawn from the 
uniform distribution on the unit sphere S'(H(d)). 

Smale's 17th problem asks for an algorithm computing an approximate zero (in the 
sense of Definition 12. ip with average cost (for the cost and data distribution described 
above) bounded by A^'-''^^ 
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2.3 Condition Numbers 



How close need x to be from C to be an approximate zero? This depends on how well 
conditioned the zero C, is. 

For / e ^(d) and x G C"+^\{0} we define the (normalized) condition number /^norm(/; x) 

by 

,(/, x) := 11/11 II (A/(2:)|tJ ' diag (^■^,\\x\f--\ 



where the right-hand side norm denotes the spectral norm and diag(ai) denotes the diagonal 
matrix with entries a^. Note that ^norm(/, is homogeneous of degree in both arguments, 
hence it is well defined for (/, x) S Also, it is well known (see [3 Ch. 12, 

CoroUary 3]) that Hnorm{f,x) > 1. 

The following result (essentially, a 7-Theorem in Smale's theory of estimates for Newton's 
method [TH]) quantifies our claim above (see [B] for its proof). 

Theorem 2.2. Assume f{() = and dp{x,() < ^,3/2^'^° — jj^ where vo '■= 3 — a/? « 0.3542. 

Then x is an approximate zero of f associated with C. □ 

The next result. Proposition 4.1 from [B], gives bounds on the variation of the condition 
number /inorm(/i2;) when / and x vary. 

Proposition 2.3. Assume D > 2. Let < e < 0.13 be arbitrary and C < For all 

f,g e S{H(^d)) and allx,y e C"+\ ifds{f,g) < D^/^f,l^„,^{f.x) anddp{x,y) < j>3/2^_f^^(y 3.) , 
then ^ 

— Mnorm(g, V) < Ainorm(/, x) < (1 + s) ^J.noi-m{9 , V) ■ □ 

1 + £ 

In what follows, we will fix the constants e := 0.13 and C :— — 0.025. 
We also introduce the mean square condition number of q given by 

tA{<i)-=^ E ^^^(g-C)- (2) 

C:<j(C)=0 

2.4 An Adaptive Homotopy Continuation 

Suppose that we are given an input system / G S'('H(d)) and a pair {g^Ct € Vp, where g is 
also in the unit sphere and such that / and g arc R-lincarly independent. Let a = ds{f,g). 
Remark that one can compute a as 

a = 2arcsin(^^^-^^ . (3) 

Consider the line segment Eg j in 'H(d) with endpoints g and /. Wc parameterize this 
segment by writing 

Egj = {qr e H(d) I r e [0, 1]} 

with Qt- being the only point in Egj such that ds{g, qr) — Ta. Explicitly, as remarked in [6], 
we have qr = tf + (I — t)g, where t = t^r) is given by 

tir) = ^4 ■ (4) 

sin a cot(ra) — cos a + 1 
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If E'gj n E = 0, and hence almost surely, this segment can be lifted to a path given by a 
continuous function [0, 1] — >■ mapping r i— >■ (qt-, C,t)- 

In order to find an approximation of the zero Ci of / = qi we may start with the zero 
C = Co of g = go and numerically follow the path (g,-, C,t) by subdividing [0, 1] into points 
= To < Ti < • • • < T/j = 1 and by successively computing approximations Xi of Qn by 
Newton's method. 

This course of action is the one proposed in the Bezout series and further adopted 
in [21 131 m E] . The (infinite precision) continuation procedure is the following (here A = 
gii^ « 6.67- 10-3, see [B]). 

Algorithm ALH 
input /, 5, C 

## (5,C)G^^, /7^,9## 
a := ds{f,g), T := 0, Qr := g 
repeat 

At •= : - 

T := min{l, r + At} 

Qr t(T)/ + (1 - t(T))r7 

X := a;/||.Tj| 
until T = 1 
RETURN X 

Note that the step-length At depends on /inorm(gr , 2;). Hence, the adaptivencss. 

The algorithm MD (Moderate Degree) from [6] is a direct application of ALH having as 
initial pair (g, Q the pair ([/, zi), where U = (Ui, . . . , C/„) S S{'H(d)) with Ui = --^{Xq^ — 

Xf)and2i = ^(l,...,l). 

Algorithm MD 
input / G •H(d) 

run ALH on input {f,U,Zi) 

2.5 Roadmap 

Theorems A and B are proved by designing finite-precision versions of algorithm ALH which 
take into account the errors due to the use of finite-precision. The variable precision version 
ALHVar is described in detail in Section |4l In particular, its main properties are shown in 
Theorem 14.31 in this section. Proposition 14.51 — a finite precision version of the inductive 
proof of [6l Theorem 3.1] — provides the backbone for the proof of Theorem 14.31 

Once with Thcorem l4.3l at hand, the proof of Theorem B is carried out in a more or less 
straightforward manner in Section [5l 

The use of fixed precision poses less demands in algorithmic design (the issue of roimd-off 
unit updating now becoming irrelevant). Algorithm MDFix is therefore a simplification of 
MDVar, which we describe in Section |6] together with the proof of Theorem A. 

As just mentioned, the backbone of all this development is Proposition l4.5l The proof of 
this result relics on finite-precision estimates for the errors in a number of basic procedures. 
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These estimates are collected in the next section. We are aware they do not make the most 
exciting part of the paper but it is a part we cannot do without. 

3 Error Bounds 

In this section we show bounds for the basic computations occurring in ALHVar. Wc will 
use these bounds in subsequent sections to show our main result. 

3.1 Basic facts 

We recall the basics of a floating-point arithmetic which idealizes the usual IEEE standard 
arithmetic. In contrast to the standard model (as in |10| ) we adapt our exposition to complex 
arithmetic. This system is defined by a set F C Q[i] containing (the floating-point complex 
numbers), a transformation r„ : C — > F (the rounding map), and a constant m G M (the 
round-off unit) satisfying < u < 1. The properties we require for such a system are the 
following: 

(i) For any x £¥, r„(a;) = x. In particular, r„(0) — 0. 

(ii) For any x £ C, r„(a;) — x{l + 6) with \6\ < u. 

(iii) For any y G F, the set r~^[y) is measurable in C. 

Property (iii) ensures that the measure Vji described in the Introduction is well defined. 
Because of the enumerability of Q[i], this measure can be seen as a discretization of the 
Gaussian in H(d). 

We also define on F arithmetic operations following the classical scheme 

xoy = ru{xo y) 
for any x,y & ¥ and o G {+, — , x , /}, so that 

o : F X F ^ F. 

The following is an immediate consequence of property (ii) above. 

Proposition 3.1. For any x,y £¥ we have 

x'oy = [x o y){\ 5), \5\<u. □ 

When combining many operations in fioating-point arithmetic, quantities such as 
]^"^j^(l + Si)^'- naturally appear. Our round-off analysis uses the notations and ideas in 
Chapter 3 of |10] . from where we quote the following results: 

Proposition 3.2. // \5i\ < u, pi e { — 1, 1}, and nu < 1, then 

n 

Ylii + Si)"' = i+0„, 

where 

\dn\<ln = - . □ 

I — nu 
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Proposition 3.3. For any positive integer k such that ku < 1, let 9k,0j be any quantities 
satisfying 

1^1 ku ,^ , iu 

Ok<lk^z r- 0,<j, = -^. 

1 — ku I — ju 

The following relations hold. 

1. (1 + + 0,) = 1 + dk+j for some < 7^+,. 

2. 

l + Oj \ l + 0k+2j ifj > k. 
for some \0k+j \ < Ik+j or some \0k+2j \ < lk+2j ■ 

3. Ifkuju < 1/2, then jklj < 7min{fe,j}- 

4. ijk < lik- 

5. 7fc + M < 7fe+i. 

6- Ik + 7j + Iklj < Ik+j- □ 

From now on, whenever we write an expression containing 9k we mean that the same 
expression is true for some 9k, with \9k\ < 7fc. 

When computing an aritlimetic expression q with a round-off algorithm, errors will 
accumulate and we will obtain another quantity which we will denote by il{q). For a 
complex number, we write Error (q) ^ \q — fl(q)|; for vectors or matrices. Error (g) will 
denote the vector or matrix of coordinates | — f 1 (g^ ) | , allowing us to choose various norms 
to estimate this error. 

An example of round-off analysis which will be useful in what follows is given in the next 
proposition, the proof of which follows the lines of the proof of the real version of this result 
that can be found in Section 3.1 of |10j . 

Proposition 3.4. There is a finite-precision algorithm which, with input x,y C", com- 
putes the inner product of x andy. The computed value fl{{x,y)) satisfies 



fi((a;,2/>) = {x,y) +^'riog,ni+iX! 



In particular, if x — y, the algorithm computes fl(j|a;|p) satisfying 

fl(||x||2) = i|a;||2(l + 0pi,g^„^+i)- □ 

We assume that, besides the four basic operations, we are allowed to compute basic 
trigonometric functions (such as sin and cos) and the square root with finite precision. 
That is, if op denotes any of these two operators, we compute op such that 

op(a::) = op(a;)(l + 6), \6\ < u. 

The following sensitivity results will help us to deal with errors in computing trigono- 
metric functions. 
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Lemma 3.5. (i) Let i, 6* G M. Then 



\cos{t + e) - cost] < \e\; 
\sm{t + 9) -smt\ < \e\; 
(ii) Given two reals a and e such that both a and a + e are in the interval [0, 0.8], one has 
I arcsin(a + e) — arcsin(a)| < 2\e\, with \v\ < \e\. 

Proof. 

(i) Observe that 



cost = 2 


sin(t+0 






sin — 


< 2 


e 

sin — 








2 




2 



<\oi 



and analogously 



\sm{t + 9) -smt\ = 2 



cos t + 



<\o\. 



□ 



(ii) Without loss of generality, let us suppose that e > 0. 

From the intermediate value theorem, there exists a ^ in [a, a+e] such that arcsin(a+e) = 
arcsin(a) + e arcsin'(^) = arcsin(a) + e- ^ 



Since ^ e [a,a + e], |^| < 0.8 and thus | arcsin'(^)| < 



< 2. 



□ 



To avoid burdening ourselves with the consideration of multiplicative constants, wc in- 
troduce a further notation. Computational errors in our context arc functions of the integer 
parameters n,N and D as well as on the condition /inorm(5, z) of the initial pair (g, z). For 
any such function we will write 

m ■■= Ooi^)- 

The next properties follow directly from the properties of the 9 notation. 
Proposition 3.6. Let <f> and ^! be two real functions. The following relations hold: 

1. m + m = [max($,*)l. 

2. [$H*l = Imax($,vI/)|. 

3. //$>!, $1*1 = 1$*]. 



3.2 Bounding errors for elementary computations 

We now begin showing bounds for the errors in the crucial steps of our algorithm. To avoid 
burdening the exposition we will do so only for the steps dominating the accumulation of 
errors and simply warn the reader of the minor steps we consider as exact. 

We begin with the evaluation of the errors in computing a. Remark that we suppose 
a < 7r/2 in the following lemma. This will be ensured by the computation of a at the 
beginning of ALHVar. If this quantity is more than tt/2, we set / = — /, ensuring that 
a < it/2. We neglect the errors in this operation, and thus suppose in the remainder that 
a < Tr/2. 
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Lemma 3.7. Given f and g in 5'('H(d)) such that ds{f,g) < 7r/2, one can compute a 
ds{f,g) with finite precision such that 



fl(a) =a(l + [logiVl). 



I/-9I 



Proof. As remarked in ([3]), one can compute a = ds(/, g) as a = 2 arcsin | 

We can compute the norm || / — g\\ similarly as the vector norm in Proposition 13.41 In 
the case of polynomials in 1-1(^^)1 the sum is over N coefficients, and thus we prove similarly 
that f 1(11/ — pIP) = II/ — .9||^(l + 6'[iogAr]+i)- Since we supposed that we can compute square 
root with finite precision, we get 

fi(||/-5l|) = ||/-.g||(l + 0riogA^l+2)- 

Remark that, since we supposed ds{f,g) < 7r/2 and ||/|| = ||g|| = 1, we have ||/ — .g||/2 < 
sin(7r/4) = l/v^ < 0.71. We can suppose that m is smah enough such that the term 6'[-iogAr]+2 
is smaller than 0.8 — 0.71, and thus such that fl(||/ — 5||/2) is also in [0,0.8]. We can thus 
apply Lemma |3.5[ and by supposing that we are able to compute the function arcsin with 

finite precision, we conclude that we can compute a — 2 arcsin ^ 

I/-5 



such that 



fl(a) 



the last line since 



= y2 arcsin 
= 2 arcsin 



2 

ll/-.g|| 
2 



(1 



2 11/ ^ .911 g^^j^^^^ j (l + 6'c)(i)) 



ll/-g|| 



< 



arcsm 



\f-a\ 



□ 



Proposition 3.8. Given r e IR+, / and g in S{H(^d)) such that ds{f,g) < tt/2, we can 
calculate t{T) with finite precision such that 



Proof. 



t{T) 



flit) : 

First of all, observe that 
1 

sinQ!Cot(aT) — cos(a) + 1 



f(l + IlogiV]). 



sin(rQ!) 



sinacos(Ta) — cosasin(TQ;) + sin(rQ!) 
sin(rQ;) 



sin(a 



f sin(TQ;) 
sin(rQ!) 



2 sin 



[1 — Tja+TQ 



sin(TQ;) 



/ {l — T)a~Ta\ 

V 2 J 



2 sin ^ cos ((i — t) a) 

We compute ^(t) via the last equality. First, we compute a following Lemma 13.71 Then, 
we show easily using Lemma 13.51 that each term in the fraction can be computed with finite 
precision up to a multiplicative factor (1 + ^^(logAf))- We conclude using Proposition 13.31 

□ 

The following lemma bounds by ||g|| the value of a polynomial q at any point on the unit 
sphere. 
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Lemma 3.9. Given d G N, q G C[Xo, . . . , X„] homogeneous of degree d and x G S{C"~^^), 
we have \q{x)\ < \\q\\ . 

Proof. Since our norm || \\ on C[Xo, . . . , Xn] is unitarily invariant, for each element 
(l)€U{n + l), one has ||/o(/>|| = ||/||. 

Let eo (1, 0, . . . , 0) G C"+^. Taking (f> such that (/)(eo) = x, one has 

\q{x)\ = |(9o0o0"l)(x)| = |(<7o0)(eo)|. 

But \qo (f>(eo) \ is exactly the coefficient of Xq in qocj) with respect to the Bombieri-Weyl 
basis of C[Xo, . . . ,Xn], and thus \qo (j){eo)\ < \\qo (j)\\ = \\q\\. □ 

Proposition 3.10. Given q e 5'('H(d)) and x e S{C"+^), we can compute q{x) with finite 
precision u such that 

||Error(g(a;))|| = (logN + Dl 

Proof. For i < n, write qi{x) = ^ cjx'^ . To compute qi{x) we compute each monomial 
cjx'^ first, and then evaluate the sum. We have 

fl{cjx-^) = cjx-^{l + ed,+i), 

and thus Error (cjx'^) < |cj||a;|'^7d;+i. 
As 

flfe(x)) = fl(^cjx'^) , 
using pairwise summation (see section 4.2 in |10| ) we have 

Error (g, (a;)) = |^ f l(c,7a;^) - ^(cjx'^ 

+ Y,iHcjx-^W[iog,N]\ 
- X! {cjx-') + |c,/2;-'|7piog^ AT] 

+ X] (cja::"')7riog2 N] 

< 'Y\cj\\x\-'{jD+l +l[log^N] +lD+ll[log^N]) 

< Y\cj\\x[''yiiog^N^+D+i- (by Proposition 6.) 

Note that I^jII^^K — by applying Lemma [3.91 to the polynomial of coefficients 

\cj\, which has the same norm as qi, at the point |a;| G S'(C"+^). Hence, 

Error (^^(a::)) < \\qt\hiiog^ n]+d+i, 

and therefore, 

||Error(g(x))|p < Tfiog^ wi+D+i XI H^*"^ = ifiog^ n]+d+i\\<i\\'^ = ifiog, w]+_d+i- 

i 

We finally have 

||Error(g(x))|l = pogiV + I?]. □ 
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3.3 Bounding the error in the computation of fi^^j.^{q,x) 

The bounds in Error {tJ.^o^j^{q, x)) scale well with q. Hence, to simplify notation, in all what 
follows we assume \\q\\ = 1. 

The main result in this subsection is the following. 

Proposition 3.11. Given q G 5('H(d)) and x e S'(C"+"'^) we can compute ^~^^^{q,x) 
satisfying 

Error {^x^^,^{q, x)) = [nQogiV + D + n)}. 
Note that under the assumption \\q\\ = 1 our condition number becomes 

'1 



Given q e 5(H(d)) and x G 5(C"+i), let Mq G 
operator 



{q,x):= {Dq{x)iTj diag( Vdi, . . . , Vc',, 

be a matrix representing the linear 



1 



Dq{x)\q 



(5) 



in some orthonormal basis of (note that Mq depends also on a;; that point x will always 



be clear from the context). We then have iJ,^^j^^^{q,x) = \\M 



= cTminiMg) where 



denotes smallest singular value. We will compute ^^^j.^{q,x) by computing Mq and then 

(^miniMq). 

The following proposition contains several technical ideas that will help us to deal with 
the matrices Dq{x)\T^ and Mq. We use ideas from the proof of [7] modifying them to the 
complex case. 

Proposition 3.12. Let q G 'H{a) and x G ^(C"^^). Then the following statements are true: 

(i) The restriction of the derivative of q to the tangent space T^ can he represented by the 
following matrix : 

Dq{x)\T^ = Dq{x)H, 

where H G ig tJi^ matrix made with the last n columns of the matrix 

defined by 



^ a{In+i-2yy*) , y 
if \xo \ ^ 1, and Hx = ain+i otherwise. 



(n) 



(Hi) 



diag [ , • ■ • , 



\Dq{x)y < ^\\q\\ 



\\x - aeoW ' 

Dqix)\T^ <\\q\\. 
Dq{x)^TM^<VD\\q\\ 



Xq 
\xo\ 
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Proof. (i) For any unitary matrix such that HxCq = x, the n last columns H of Hx 
form an orthonormal basis of T^. Thus Dq[x)H is the representation of Dq{x)\T^ in that 
basis. 

The matrix computed here is constructed in [3]; one checks easily that it is unitary 
and that it satisfies H^eo = x. 

(ii) Let g = qoHx- Then, differentiating the equality gi(H*x) — qi{x) and multiplying 
both sides by H on the right, we have 



Dg,ieo)H*H = Dq,{x)H = Dq,{x)\T^, 
where the last equality is by (i). Observe that H*H = [ei, . . . , e„], hence, 



— (eo),...,^M 



(6) 



(7) 



1 /2 

If we denote gi{X) — (^) giaX", it is straightforward that 



dgi 



(eo) = 



d, - 1 



1/2 



Therefore, from ([7]), 



.9i(ej+(d.-l)eo) - ydi- 9i{e,+{di-l)eo)- 



(8) 



and hence by ^ we have 
diag ' 



Dg{eo)i 



F ^=l 



j=Dg,{eo)T, 



< T.\\9'^f = \\9\\'- 



(9) 



Since the Hcrmitian inner product associated with the Bombieri-Weyl basis is unitarily 
invariant, we have 

11.911' = (.9,. 9) = {qoHx,qoH,) = (g, = \\qf , 

which by (H]),© and and since the spectral norm of a matrix is not greater than its 
Frobcnius norm, yields 



diag ( 



1 



< 



m = \m 



□ 



The relations (iii) can be shown similarly. 

The following two statements are similar to those proved in in the real case and 
similar ideas are used in the proofs. 

Proposition 3.13. Given q G S'('H(d)) and x G S'(C"+^), we have ||I?(7(a;)|j-^ || < VD, and 
we can compute Dq{x)\T^ with finite precision such that 



||Error(i^g(a;)|Tj||f < [?iVi:'(log iV + + log 
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Proof. The inequahty \\Dq{x)\T^ \\ < vD follows from = 1 and Proposition l3. 12f iii') . 

We compute Dq{x)\T^ as in Proposition I3.12t i). Hence each entry of the matrix 
Dq{x)\T^ is calculated as the product of Dqi{x) and the jth column Hj = {hkj)i<k<n+i of 
H. Proceeding as in the proof of Proposition 13. lOl wc can compute -§y^{x) with 

( 



Error (^^W J = {{log N + d,mDq,ix)\\ p. 

One can compute a as 5° with two arithmetic operations and one square root. 

Observe that to compute x~ aeo, we need to perform only two more arithmetic operations. 
Also, 



iyy*)ii = — r— ((^ - aeo){x - aeo)*).j 



1 

"'^ ll-T-aeoll 
and we have 

fl ((a; - aeo)(x - aeo)*)^^ = tl (^{x - aeQ)i{x - aeo)^^ 

= {{x~aeo){x-aeo)*)^j{l + 0ii). 

Further, 

f 1 (llx - aeoll^) = f 1 XiXi + (xo - a)(xo - a)^ 

Here we used pairwise summation bounds again. 
Thus, by Proposition 13.3( 2') . 

f 1 {2yy*)^^ = {2yy*)^^ (1 + ^ariog, («+i)l+35)- 

Finally, taking into account one more addition and the multiplication by a, we get 

Error (/ijj) = eizriog^ (n+i)l+39 = l^ogn}. 

Applying Proposition [331 conclude 



Error ([i?g(x)|Tj.,) - \fli{Dq,ix), H,)) - (Dq^ix), Hj)\ 
= \{iliDq,ix)),fliH,)) 

k 

< \{ETTOT{Dq,{x)),H,) \ + \{Dq,{x),ETTOT{H,))\ 

+ 1 (Error (Dg^ (x) ) , Err or (i/j )> I + 7riog2 n] +i I Dq, {x)\\Hj\ 

= ([(logA^ + i?)] + lV^lognl 

+ [(log iV + I log n] + I log nl ) 1 1 Dq, {x)\\f 

= iV^ilogn + log N + DmDq,{x)\\F. 

This implies 

||Error(i?g(x)|Tj|lF = ln{\ogn + D + \og N)}\\Dq{x)\\F = [n\/;D(logn + i^ + logiV)] □ 
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Proposition 3.14. Given q e Sin^d)), x e 5'(C"+i) and M, defined by we have 
< 1. In addition, we can compute such a matrix Mq with finite precision u such that 

||Error(A/g)||F = |n(log + D + logn)]]. 

Proof. The inequality \\Mq\\ < 1 follows directly from Proposition 13. 12f ii) . as — 1. 
Floating-point errors can be evaluated exactly as in Proposition 13.131 however, one gets rid 
of the factors VZ? since the bound on ||Mg|| is better than the bound on ||£'(7(a;)|7;. j| . As 
a counterpart, one has to take into account one more division by ^/di of each entry of the 
matrix, which slightly changes the constants, but leaves the order in N, D and n unchanged. 

□ 

Proof of Proposition 13. iTl We use ideas from the proof of an analogous proposition 
in [7]. Let x € 5(0"+^), q G S(H(d)) and Mq be as in Proposition EH Then n~^^^{q,x) = 
o'min(Afg) = |jM~^||~^ and we can compute the first expression by computing the last. 
Let E' = Mq - fl{Mq). By Proposition IXTil 

ll^^'ll < II^^'IIf < H\ogN + D + \ogn)l 

Let Mq = ±l{Mq). We compute crmin(Alg) = ||Af~^||^^ using a backward stable algorithm 
(e.g., QR factorization). Then the computed f l(CTniin(.^<;)) is the exact cr^^^^Mq + E") for 
a matrix £"' with 

\\E"\\<cn'u\\Mq\\ 
for some universal constant c (see, e.g., [9l[T0]). Thus, 

ilian,in{Mq)) = f l(cTmin(X?)) = Crr,,in{Mq + E") = a,„i„(M, + E' + E"). 

Write E = E' + E". Then, using < 1 (by Proposition EH]), 

ll^ll < ll^'ll + \\E"\\ < \\E'\\ + cn\\\Mq\\ < \\E'\\ + cn^u{\\AIq\\ + \\E'\\) 

= |7i(log N + D + log n)] + [n^]] (1 + [n(log iV + D + log n)i ) 

= |n(log + D + logn)i + [n(log7V + D + n))} 

= ln{logN + D + n)l 



using Proposition 13.61 in the penultimate row. 

Therefore, fl{a,^in{Mq)) = a-min(M, + E) which implies by [9l Corohary 8.3.2]: 

Error (crmin(A/g)) < \\E\\ < [n(logiV + D + n)]. □ 

3.4 Bounding the error on the Newton step 

We next evaluate the error in the computation of a Newton step. Our result is the following. 

Proposition 3.15. There exists a universal constant e > such that given a system q G 
5('H(d)) and a point x G S{C"'^^), if the precision u satisfies 

e 

u < 



D-^^j:i^^,-^{q,x)n{D + log N + n^y 
then the error Error {Nq{x)) satisfies 

||Error(7V,(a;))|| ^ C{1 ~ e) 



< 



\\Nq{x)\\ - 27r(l + e)2L>3/2 

/^norm 

where C and e are the constants introduced in Proposition \2.3\ . 
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q{x) 




We compute Nq{x) — a; by solving the linear system 
We denote D, = 

Recall the following result from (TUl Chapter 7] (in fact, Theorem 7.2 therein applied to 

f = b/\\b\\ ■And E = A/\\A\\). 

Lemma 3.16. Given a linear system Ax ~ b, approximations A' of A and b' of b such that 
\\^~^'\\ ^ ll^^^'ll ^ £ anrfe||^~^|| < 1, the solution x' of the perturbed system A' x' = b' 
satisfies : 

II-'- -II <T^^^(i + 11-11)- ° 

Furthermore, from [lOl Chapter 18], the solution ir of a linear system Ax — b, where A 
is non-singular, computed with a QR factorization with finite precision satisfies 

{A + AA)x = b + Ab, (10) 

where \AA\ < n'^jcnG\A\, \Ab\ < n'^jcnG\b\, \\G\\f = 1, c e E. Here, \A\ denotes the matrix 
with entries |ay | and the same for |A^|, \b\, and |A6|. 

Lemma 3.17. Letq e S{'Hd) andx G S'(C"^"'^). We can compute Ng(x) with finite precision 
such that 

ll^gWll 

q{x) 



Proof. To simplify notations, in this proof, we write q{x) instead of |^ J ' ^ 

denote our computed solution of Dgy = q{x). 

From (Uni), f l(y) is the solution of a system {ADg + fl{Dq))y = Aq{x) + fl{q{x)) with 
\\AD,\\ = [rrW] and \\Aq{x)\\ = ln% 

From Proposition 13.101 given q S S{Hd) and x E 5(C"+^), we can compute q{x) with 
finite precision u such that jjError ((7(a;))|| — + logA^|. Obviously, ||Error (Z?(j)|| is not 
greater than the bound we computed for ||Error (/^^(a;)/^)!! in Proposition 13.131 Hence, 
Error (Dg) = (ny/D {\ogN + D + logn)j. 

Furthermore, from ||AD,|| = {n^y/Dj and \\Aq{x)\\ = fn^. 

Finally, both terms ||AZ)g + Error (Z?g)|| and ||A(7(a;) + Error (q(.T))|| can be bounded 
by an expression e = ln\/lj{\ogN + D + n^)]. 

Thus, from Lemma 13.16) the error on y is bounded as 

Error (y) < {1 + \\y\\) 



= (l + l|y||)lM norm 

(q, a;)nV D{D + log N + n )]] , 

the last line since ||-D^^|| = 0(//norm(g, a;)) and < j4k- 

Since Nq{x) belongs to the tangent space to the unit sphere T^, ||A^g(a;)|| > 1, and 

1 + ||y|| = 1 + ||iV,(x) - x|| < 1 + ||iV,(a;)|| + ||a:|| < 3||^,(x)||. 
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Hence, 

llError {y)\\ = 3\\N,{x)\\ Ifinon-niq, x)n^{D + \ogN + n^)\. 

Then, the computation of Nq[x) from y = Nq{x) — a; is a simple addition and does not 
change the order of the errors. □ 

The proof of Proposition 13. l51 is now immediate. 
3.5 Bounding the error for Ar 

We evaluate here the errors in the computation of the quantity At, that is, the size of the 
current step in the homotopy. 

Proposition 3.18. For x E ^(C"^^), and f,g,q G S'(?^(d)) such that ds{f,g) < 7r/2 define 
the quantity 

A 

ds(/,.9)£'^/2MLrm(9,a;)" 
There exists a universal constant f > such that 

f 

" - n(log N + D + n)/xg„^(g, x) 

implies 

Error (At) < —At. 

4 

To prove this proposition we rely on the following lemma. 

Lemma 3.19. Given x G 5(C"^^) and q G 5'(H(d)), one can compute a'^i^{Mq) with finite 
precision u such that 

ErrorKiJA/,)) = {n{\ogN + D + n)}. 

Proof. By Proposition 13. Ill Error (crmin(A/q)) = |7i(logiV + D + n)]]. Hence, we have 

I f l(a^i„(M,)) - < 2|a„,i„(Af,)|In(logiV + D + n)} + In(logiV + D + n)f 

< ln{\ogN + D + n)l + ln{logN + D + n)l 

since, by Proposition EH |crmin(Mq)| < < 1. Thus, 

Error {al,M)) = [n(log N + D + n)} . □ 

Proof of Proposition 13.181 One has 

fl(AT) fl 



aD3/Vnorm('?>2;) 

+ 

Xfl {al,jMq)) 

+ '^oiiogN)), 
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the last equality being from Lemma 13.71 and thus by Lemma 13.191 

Error (Ar) = _A_(|„(log7V + + n)l + |log7Vl) 
^ 'n{\ogN + D + n)j. 



If u satisfies with a value of f small enough, the term |n(log + D + n)] may be 
bounded by 

H\ogN + D + n)}< \ 

and consequently 

Error (At) < —4 ; r = -Ar. □ 

3.6 Bounding the distance between qr and 

We evaluate here the error in the computation of q-r, given /, g, and r. 

Proposition 3.20. There exists a universal constant g such that the following holds. Let 
f,g G S'('H(d)) with ds{f,g) < §(1 + 1/6) be given with roundoff error u. Let t € [0,1]. 
Then for all A e (0, 1), 

u < 

~ logiV 

implies 

II fl(g,) - QrW < A. 

We first bound the distance between the points t/ + (1 — t)g and t f l(/) + {I — t) il{g), 
without taking into account the error in the computation of t. 

Proposition 3.21. Assume that f,g,f,g G S{TL(d)) are such that ds{f,g) < ^(1 + 1/60) 
and\\f-f\\ < 1/60, < 1/60. Forte [0,1] define q = tf +{l-t)g andq = tf+{l-t)g. 

Then 

<2max{||/-/||,||g-.9||}. 
To prove Proposition 13.21] we rely on the following lemmas. 
Lemma 3.22. Let f,g e H(d) with ||/||, ||.9|| > a > 0, ||/-,g|| < P with a > /3/2. Then 

!(/,..) I (12) 
2a^ 



Proof. Pick any f,ge H(d) with ||/||, ||g|| > a, < /?, denote r = {f + g)/2, and 

let s G H(d) be such that ||s|| = ||/ — ff||/2, s _L r. Then from the orthogonality of r and s 
we have 



ii, + ,||^ ^ ||, _ ,f = + = m±ML > > 

also, 

||(^ + s)-(r-s)||-2||s|| = j|/-5||</3. 
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Therefore, 



\{r + s,r^s)\ ^ \\\rr-\\sr\ ^ |||/ + gf - ||/ ^ ffH^I ^ 



+ - ll/illl,9il 4|l/|l||g|| - ||/!||l.g||- 

Since \\r + s\\ = \\r — s\\, we have 

. \{f:9)\ . l(/,g)l 

"^1^ II r = mm (13) 

ii/ii>ii9ii>° n g ii/ii=iisii>c f (7 

||/-g||<^i II-' II 11-^11 ||/-9||</3 II-' II 11-^1 

Now assume that /, g G 'H(d) with ||/|| = \\g\\ > a, \\f — g\\ < /?. Let 



f-g ^ V4«' - _ f + 9 



2 2 \\f + gr 

, 13 g-f ^ v/4a2 - f + g 
9 - 



2 11/ -.911 2 \\f + gW 

It is not difBcuh to check that ||/'j| = j|g'|| = a and ||/' — g'\\ ~ /3. Moreover, 

i(f ,ff'>i ^ 1 _ < wfw' + yr 11/ -gf < \{f,9)\ 



2a^ - 2\\f\\\\g\\ 2\\f\\\\g\\ " \\f\\\\gW 
Therefore, 

. |(/,g)l _ . \{f,9)\ .... 

ii/ii=ii9ii>ni/iiiigii 11^11™'^=° 11/11 iigir 

||/~b||<3 II-* iiii^ii II/~s!I=3 iiii^ii 

From and (fTi)) we have 

. l(/,g)l . MM>i 

mi'^ II J = mm ,|„,||| , > 1 - — ^, 

Ii/ii.ii9ii>° / g ii/ii=ii3ii=° / g 2a^ 
ii/-gii<3 II-' iiii^ii ii/-sii=3 II-' iiii^i 

which shows □ 



Lemma 3.23. Let f,g E ^(d) 11/ — g|| mmlll/llj l|g||}- ^/len 



ds(/,g)<4- (15) 
V3 mm{||/||, llgill 

From Lemma 13.221 we have 

where (3 = \\f — g\\, a ~ min{||/||, j|g||}. From the Taylor expansion for cos we obtain 

cosds(/,g) < 1 + — , 

therefore 

ds (/,g) + ^ >0. 
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Solving the relevant quadratic equation for ds^lf^g), we have 



By our assumption /3/a < 1, therefore, 



6(l + \/l-|^K^ • (16) 



6|l + ^/l-^ 



2 



and the interval on the right-hand side of ([T5)) is irrelevant (as ds{f,g) < tt). We have (using 
/3/a < 1 again) 



^ + V 3^ 



which yields ([151). ^ 

Lemma 3.24. Let f,g G 'H(d), Hffil = ||/|| = 1, a?icJ ds{f,g) < §(1 + (5). T/ien, given 
t e [0, 1], q{t) =</+(!- t)g satisfies 



Proof. Consider the function : K E defined as follows: 

^(t) = = ||.gf + 2i5R(.g, / - .g) + t^H/ _ g||2. 

Observe that mintgR ip{t) is attained at 

^ 2SR(g,/-.9) ^ h-fr + hr-wir ^ i 

211/ 2|l/-g||2 2' 
and (^(i*) = + We then have 

1„.,„„2_. d^{f,g)^^ ^'{l + 5f 



Mt)r >-A\f+9r = i- ^^v^ > 1 - > 1 - 



411. ^1, 4 - 4 - 16 ' 

which gives us (fT7| . □ 
Proof of Proposition 13.211 Observe that 

=l|i/ + (l-i),9-t/-(l-05ll 
< ill/ -/II + (1-011.9-311 
<max{||/-/||,||g-5||}<i^. 

From Lemma 13.241 applied with 5 = ^ we have 
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and hence 

||g1l>lkl|-||g-g1l>^-i/60 = ^. 

Now applying Lemma 13.231 we have 



:{!l/-/!MI.9-5ll} r ^ . 
^ ^ <2max|||/-/||,||.g-5||} 



2 \\q-q\\ 2_ ina^^- 

V3 min{||g|U|g1|}- 

Proof of Proposition 13.201 Let us denote / = f l(/), g = f l(g) and t = f Let 
Qt- denote f l((j'r) and g,- the system if + (1 — i)g. By hypothesis, both ||/ — /|| and \\g — 7]\\ 
are not greater than u. 

Thus, by Proposition [MD if u < 1/60, 

||?r - qrW < 2w. 

From Proposition [3]8l t = t{l + OQ(^iogN))- Thus, there exists a constant g such that for 
allvle (0,1), M< 1^ imphes 

Mr-qrW < A/2. 

Taking g < 1/60, u < j^jy- ensures 

~ ~ 5 

Ikr - qrW < hr - ?rl| + ||?r - QtW < -A < A. □ 

3.7 Estimates for u 

Along our homotopy, the precision needed to guarantee correctness varies with the system 
q considered. In our variable-precision algorithm we will want to keep this precision at all 
times within the interval [■^'B{q, x) ,'B[q, x)] with 

nD^{\ogN + Dln^)f,l,,^{q,x)' ^^^^ 

where k2 is a universal positive constant (that will be specified in Definition I3.28P . 

Now, since q and x vary at each iteration, one has to update the precision as well. 
To do so one faces an obstacle. When computing u we actually obtain a quantity f 1(m) 
which depends on the current precision and this current precision has been computed in the 
previous iteration. Proposition 13.251 below shows that this obstacle can be overcome. 

Proposition 3.25. If u < {I + e)^B{q,x) then 

Error (^^B{q,x)^ < ^B{q,x). 

In particular, when computing u := |B((7,x) the computed quantity satisfies fl(M) € 
[iB(g,x),B(g,x)]. 

Towards the proof of the proposition above we define 

^^'^'''^ ^= nD^logN + D + n^)f,l,,^{q,x) 
so that B(g, x) — k2B{q, x). Our first lemma bounds the error in the computation of B{q, x). 
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Error x)) < — j. 



Lemma 3.26. There exists a positive universal constant such that the following is true. 
Assume u < 2fc3rt(iog jV+_D+n) • Then for any x £ S{C^'^^) and q e 5'(H(d)), one can compute 
B{q, x) such that 

Proof. From Lemma [3.191 we can compute o-f-^^-^^^iMg) with error |n(log + D + n) 
we can compute i?((7, a;) such that 

n_D^(log A* + D + n'^j 

and thus 

Error H^o,N + D + n^)l 



nD'^(logN + D + n^)' 
It foUows that there exists a constant such that 

Error (Bfa x)) - ^^^^(log w+Z3+n^) 

Thus, when u < 2k3n(iogN+D+n^) ' denominator in 7fc3n(iog Ar+i>+n) is greater than 
1/2, and 

Error(i?(q,a;))< ^. □ 

Corollary 3.27. Let k2 be a positive constant such that k2 < jj^^fj:fe)^' '^^^ condition 
M < (1 + e)^k2B{q,x) ensures that 

Error ^k2B{q,x)^ < ^k2B{q,x). 

Proof. If u is less than or equal to {l+e)^B{q, x), since /inorm(<?, x) is always greater than 
1, if we choose k2 not greater than 2k3{i+s)o ' " ^^^^ ^^^^ than or equal to 2fc3n(iog iv+D+w^) • 
Thus, one has 

Error <^< ^2 B{q,x). 

Taking k2 < ^^k^li+eY ^^"^ ^^^^ Error (|fc2i3(g, .t)) < Error (fc2i?((?, x)) < jk2B{q,x). □ 

We now have all the conditions that the constant k2 must fulfill. 
Definition 3.28. Let k2 be a positive constant chosen small enough such that 

(i) k2B{q,x) is smaller than the bound on u in Proposition 13 . 18l 

(ii) fc2-B((7, x){l + e)** is smaller than the bound on u in Proposition 13. l51 

(iii) k2 < ; where g is defined in Proposition l3.20l 

(iv) ^2 verifies the condition in Corollary 13.271 

The first three conditions ensure that the precision will be good enough for the com- 
putation of the values of At, of the Newton operator and of qr. The fourth condition is 
needed for the computation of B{q,x) itself. 

Proof of Proposition 13.251 From (fT8|) . the bound B(g,a;) equals k2B{q,x) and the 
result now follows from Corollarv 13.271 □ 
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4 Analysis of the Homotopy 



We next describe with more detail our procedure ALHVar — Adaptive Linear Homotopy 
with Finite precision — to follow the path {(^t, Ct) | t S [0, 1]}. 

All the certifications on an execution of ALHVar will be for inputs satisfying certain 
conditions. We thus define the notion of admissible input for ALHVar. 

Definition 4.1. An admissible input for algorithm ALHVar consists of 

• A function read_input ( ), that returns an approximation of a system / G S'('W(<j)) 
with the current round-off unit. That is, the instruction read_input / ( ) returns a 
system /' such that the coefficients a'^ of the polynomials satisfy 

\a'a - «a| < u\aa\, 

where Oq is the coefficient of the monomial of the same degree a of fi. In particular, 
this implies that 

II/- /'II < "11/11- 
Note that read_input f( ) is not required to be computable. 

• An auxiliary system g G 5'(H(d)), supposed to be given exactly. 

• An approximate zero x E 5(C"+^) of g satisfying 

for its associated zero (. 

• An initial round-off unit u G R_|_ such that 

u < B{g,x). 

For clarity, we denote such a tuple {f,g,x,u) and we refer to it as an input to ALHVar 
even though / is not given directly and the precision u is not passed as a parameter (it is a 
global variable in MDVar). 

Define A := « 5.37 • IQ-^. 



25 



Algorithm ALHVar 
input (/, g, x) 

f := read_inputj( ) 

if dsil, g) > f then g := -g 

T := 0, Qr ■■= g 

repeat 

At:=^ ^ 

T := min{l, r + Ar} 
/ := read_inputj( ) 

9r :=t(r)/ + (l-i(T))5 

rv. ■ X 

u := jB{qr,x) 
until T = 1 
RETURN X- 



Remark 4.2. The algorithm ALHVar is a finite-precision adaptation of the algorithm ALH 
in [6]. It has a slightly smaller stepsize parameter A. By the parameter / given to ALHVar, 
we mean, that the algorithm is given as input the procedure read_input j that returns finite 
precision approximations of /. 

We may use ALHVar to define a finite precision version MDVar of MD. 

Algorithm MDVar 
input / e 'H(d) 

hr, 

u := 



nD2(log Af+D-|-n2)2(n+l)" 

run ALHVar on input {f,U,Zi) 
To a pair / e S{'H[d)) and {g, C) G Mp we associate the number 

fJ-*{f,9,C) max Mnorml^r, Cr)- 

re[0.1] 

Theorem 4.3. Let {f,g,x,u) he an admissible input o/ ALHVar. Then: 

(i) If the algorithm ALHVar stops on input (f,g,x), it returns an approximate zero of f . 

(ii) Assume ALHVar stops on input (f,g,x). Then, the number of iterations K{f,g,x) 
performed by ALHVar satisfies 

K{f,g,x)<B[f,g,0 + B[-f,gX) 

where 

/■I 

tJ^Lrn,i<lT,Cr)d7 



B{f,g,(:) := 408ds(/,g)i?3/2 f ^ 

Jo 



26 



Consequently the number of performed arithmetic operations costALHVar(/j .9, 2:) is 
hounded by 

COStALHVar(/, 9, x) < 0{N) {B{f , g, + B{-f, g, 0) . 

//ALHVar does not stop then either B{f, g,C,) or B{—f,g,(^) is unbounded, and either 
the segment Egj or Eg_-f intersects E. 

(Hi) Furthermore, the finest precision u^{f,g,x) required during the execution is bounded 
from below by 



u*if,g,x) = n 



1 



nD^i\ogN + D + 72^)^ilif,g,0 



4.1 Bounding errors in the homotopy 

We begin with a simple consequence of Proposition 12.31 



Proposition 4.4. Assume D > 2. Let po,pi E S{H(^d))) 1st C, be a zero of po, and A a 
positive constant not greater than C such that 

, . ^ A 

ds[Po,Pij < 



Then the path Epg,p-^ can be lifted to a path in starting in (po, C)- addition, the zero x 
of pi in this lifting satisfies 

dACx)< 



i'^/^MnormbljX)' 

Finally, for all pr G Ep^^p^, if Ct denotes the zero of pr in this lifting, we have 

Mnorm Mnorm iPrXr) < (1 + ^ ) Mnorm 



l + e 



Proof. For each r e [0,1], let Pt be the point of the segment [po,pi] such that 
rfsbcPr) = Tds{po,pi). 

Let T* be such that /J"* /Xnorm(PT, Ct)||Pt|Mt = D3/2fi„t,nipa.C) ' ^ ^' ^^^"^ ^^^^ 

Epa,pi cannot be lifted to V beyond t*, whichever is the smallest. Then, for all r G [0,t*], 
using that < nor in (Pr,Cr) IjPrl! (cf. §12.3-12.4]) we have 

(PsXs) \\Ps\\ds 

Jo Jo 
A 

< 



D^/'^Hnoi-m{PO,C) ' 



It is therefore enough to show that = 1. Suppose to the contrary, that < 1. 
Since ^norm(Pr,Cr) > 1, for cvcry T, 

A 

ds(po,Pr) < ds{po,Pl) < 



£'^/^Mnoim(PO,C) 
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Since A < C the bounds on dsipQjPr) and dp(C, ^t-) allow us to apply Proposition 
and to deduce, for all r S [0,t*], 

Mnorm(PO, C) ^ flnormiPr Xr) < (1 + e)Mnormbo, C)- (19) 



1 + e 
We have 

A 



and thus 



= / Mnorm(PT,Cr)||PT|MT (by definition of T* ) 
Jo 

< (1 + e)Ai„orm(Po, C) / ' IIPrlMr (by m) 

= (is(po,PrJ(l +£)/inorm(po,C), 



which leads to a contradiction with < 1. and finishes the proof. □ 

The next proposition puts together many of the results obtained thus far. The general 
idea for its proof closely follows [51 Theorem 3.1] (which in turn is a constructive version of 
the main result in |12| ) making some room for errors. 

Let {f,g,x,u) be an admissible input for algorithm ALHVar. 

Let Q = tq<t\<T2< . . ., X ~ ajo, X]",X2, . . ., and uq,ui,U2, ■ ■ ■ , be the sequences 
of r-values, points in S'(C""'"^) and precisions generated by the algorithm ALHVar on the 
admissible input (/, g, x, u). Let fi be the approximation of the input / on the ith iteration. 

Let Egj be the path with endpoints g and /. To simplify notation we write Qi instead 
of Or- and Q instead of Similarly, we denote by qt the computed approximation of qi 
— that is, (ji = f 1 {t{Ti)fi + (1 — t{Ti))g) — , by Xi+i the exact value of Nq.{'Xi), and by r^+i 
the exact value of t7 + Ar. 

Proposition 4.5. Let [f,g,x,u) be an admissible input for ALH\/ar. Let k be the number 
of iterations o/ ALHVar on input {f,g,x,u) — that is, either k = oo or Tk = I, qk = f ■ 
With the notations above, for all i S {0, . . . ,k — 1}, the following inequalities are true: 

(J 

a) dp(x7, C,i) < 



B{qi,xi) ^ _ ^ „ ,~ 
u) <Ut<B{qi,Xi) 



x) ds(gi,9i)< 



-D^/^Mnorm(gi,Ci) 

C(l-e) 



c) dg{qi,q^+i)< 



12(l + e)Z?3/2^norm(%,C») 

(l-e)C 



2{l+e)Dy^fInor.^{q,,Q) 

(l-e)C 



d) qi^i has a zero Q+i such that d-p{(^i, Q^i) < 

e) qi+i has a zero Q-^-i such that dr(jXi,C.i+i) < 



2{l+e)Dy^Hno,m{q^,(^) 

C ((l + e) + 7/12(1 -e)) 



D^^^ ^inorm{qi+l , Ci+l) 

Inequalities (a), (u), and (x) hold for k as well in case k < oo. 
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Proposition l4.5l Duts together all the needed bounds to ensure the proper work of ALHVar. 
Statement (a,z) ensures that Xi is "close enough" to Q. That is, x7 is not just an approximate 
zero of Qi, but also an approximate zero for polynomials in a certain neighborhood of qi on 
Eg J. Statements (c,i) and (d,i) show that (taking into account computational errors) 
our step along the homotopy is so small that the next polynomial g^+i belongs to this 
neighborhood. We hence arrive at (e,i), which essentially means that is an approximate 
zero of Qi+i associated with Q+i- Therefore, the Newton step (with computational errors 
accounted for) brings the next iterate Xi+T close enough to C-j+i to ensure that (a,i + 1) holds 
again. Making sure that (u) holds on every iteration, we guarantee that computational errors 
are small enough to allow all the other steps of the proof ((a), (c), (d) and (e)) to be carried 
through. 

Proof of Proposition 14.51 We proceed by induction by showing, that (a,i), (u,i) 
and (x,«) imply successively (c,i), (d,i), (x,i + 1), (e,i), and finally (a,i + 1) and (u,i + 1). 

Inequalities (a) and (u), for i = hold by hypothesis, and ((x), 0) is obvious since 
9o = 9o = 9- 

This gives us the induction base. Assume now that (a), (u) and (x) hold for some 
i<k-l. 

We now show (c,i) and (d,i). 

Observe that together with (a,i) and (x,i). Proposition 12.31 implies 

norm 

{q^,x-). (20) 

(1 + e) 

By (u,i) and Definition 13. 281 our precision satisfies pT|) for the pair {qi,Xi). Therefore, 
by Proposition 13. 181 and the definition of At in ALHVar we have 

a{Ti+T -n) < Q;(Error (At) + t^+i - tT) 
5._/ Ml + 3) 



< a-AT < 



4 Dy^fil^^„^{q^,m)' 

So, using (|20p and since A := ^j^tjt, we obtain 

, C(l - e) C(l - e) 

dsiqi,qt+i) = a(n+i ~ n) < — — — „ < 



2(l+£)W2^Lrm(9.,^) " 2{1 + e)^ D^^ f^l^.^iq,, Q ' 

Since /inorm(9j, CO is always greater than or equal to 1, (c,i) holds, and (d,i) is the direct 
consequence of Proposition 14.41 applied to {qi,qi+i) and Q, with A ~ ^(i+e) • 

This application of Proposition 14.41 furthermore ensures that, for all t G [t^, Ti+T], 

/Xnorm(g»,CO ^ ^^^^^^^{q^ ^ (^) < (1 + e)Ainorm(gi, G), (21) 

1 + e 

and. in particular, 

Mnorm 



1 norm 

(ft, CO- (22) 
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Since u < B((7j,Xi) and from Definition 13.281 we can apply Proposition 13.201 with A = 



g(l-e) 



and we get 



~ ^ . Cil-e) 
ds(qi+i,qi+i) < 



and, hence, using ([22]) . 



rfs(g.+i,g:,+i)< n3/, . V (24) 



Since /inorm(gi+i, Ci+i) > 1 this shows (x, i + 1). 

We can now use (x, i), (c, i), and (|23l) to bound ds{qi,qi+i) as foUows, 



< 



C(l^£) C(l~£) C(l-£) 

12(l+e) 2(l+£) 12(l+e)a 



< 



Cjl-e) 
(l+e) 



-D^/^A'norm('?i,C») 



< (25) 

the third inequahty using /^norm (ft , Ci ) ^ 1 ^-iid the last from (|20p . We can similarly bound 
distances between zeros and their approximations. Indeed, using ([M]) . Proposition 14.41 ap- 
plied to {qi+i,qi+i) and Q+i, with A ~ ^^^xj-^i ensures the existence of a zero Ci+i of ft+i 
such that 

rfp(C.+i,Cm)< ^^„3,,^^^''^ ^ (26) 

12Z;'^/^/i„orra(ft+l, Cj+i) 

Next we use the triangle inequality to obtain 

dp(x7, 5+1) < (ip(x7, Cj) + fip(C:, Ci+i) + c?p(Ci+i, 5+i) 

^ T^2 V^ + 7W^ T ^ (by (a,z), (d,z) and ®) 

C(l + e+-i^(l-e)) 

^ 7W2 / . V (byfESD) 

£''''^Mnorm(gj+l,Ci+l) 

which proves (e,i). 

Note that (x,i+l) and (|26p . together with Proposition l2.31 imply that /inorm(9j+ij Ci+i) ^ 
(1 + e)A*norm('7i+i, Ci+i)- Also, that we have C(l + £)(1 + s + j^(l — e)) < i^o ~ 0.3542 and 
hence dp (si, 6+1) < -^nz ^ " — Wc can therefore use Theorem 12.21 to deduce that 



Xi is an approximate zero of g^+i associated with its zero Ci+i- Therefore, x^+i = Nq.^-^(xi) 
satisfies 

^ t 7 A (-7 \^ C(l + £+ j^(l-£)) 

dp(xi+i,0+i) < - dp (a;,, 0+1) < ^tt^tt^ r r, (27) 

2 2Z;'^/^/^norm(gi+l,Ci+l) 
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where the last inequahty is due to (e,i), and thus 

^ |C(l+e+^(l-e)) + ^C(l-e) 



D^^^ fJ-noimiqi+l, Ci+l) 

(28) 



C(i(l + e) + |(l-e)) 



Now wo arc ready to prove the last two implications. We first show (a, i + 1). 
Inequality (f25|) allows us to use once more Proposition 12.31 to deduce 

{qi+i,Xi) < (1 + e)n 

norm 

{q^,xl). (29) 



1 + e 



Since Ui is less than or equal to 'B{qi, Xi) (by (u,i)), from the choice of the constant k2 
in Definition 13 . 28r ii) one has 

_ p. 

Ui < 



(1 + e)2„D2(log7V + D + n^)fj.l^^^iq,,x-) 

- nD^logN + D + n^)pl,,^{q,+,,xl) 

The condition on u (for the pair of Proposition [3T5] is thus verified, and applying 

this proposition we obtain 

llx^ II = Error (iV,.,,(a^)) < 4.(1 + e)2S/CoL(?.-M, ^) ' ^''^ 

The proof of (^5]) imphcitly shows that ds{qi,qi+i) < ^3^2^^^"^^-*^ ^ . Together with 
(a,i) we are in the hypothesis of Proposition 12.31 and we can deduce 

/^norm 

{qt+l,Xi) < (1 + e)// norm 



l + e 

This inequality, together with p2|). yields 
1 



/-^norm Mnorm (^j+IjG+i) (31) 



(l + e)2 

and using these bounds (PH]) becomes 

||5^-a;.+i|| < , n3/2^^^T^ r V (32) 
We now use this bound and the triangle inequality to bound dp{xi-^.i, Q+i) as follows 
dr{xT+T,Q+i) < dp{x^,Xt+i) + dp{xi+i,Ci+i) 

TT 

< ^ , + f^3',r"f"r» (byB.ndC8) 

8L''^''^//norm('7i+l, Ci+l) 2L'-^/ ^/tnorm (^j+l , Q+l ) 
C(|(l + £) + |(l-£) + |(l-£)) _ C 



-D3/2/i„o,.„i(qi+i, (i+i) -D^/^Mnorm(9i+l, G+l) 
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which proves (a) for i + 1. 

It remains to show (u, To do so note that we may use (a,i+l) and (x, together 

with Proposition 12.31 to obtain (PU)) for i + 1 (just as we obtained it for i). Consequently, 

norm 

norm (qiXi) (by (EH)) 

< (l+e)Vnorm(9»,^) {hj ■ 



Using this bound afong with (u,i) we obtain 
Ui < B(q;;,x7) = 



fc2 



< 



711)2 (log N + D + n2)^2^^^(g,, a;,) 
k2{l + ef 



7iD2(log AT + D + 7j'^)fj.l^^.„^{qi+i,Xi+i) 



We can therefore apply Proposition 13.25] with the pair {qi+i,Xi+i) to deduce that 
Error (|B(g;+i,x~jT)) < jB{qi+i,x-^), and consequently 



Ui+i - -B{qi^i,Xi+i) 



< -^'B{q^+l,x,+l), 



which proves (u,i + 1) 



□ 



4.2 Proof of Theorem [4731 

(i) Since {f,g,x,u) is an admissible input for ALHVar we can use Proposition 14.51 (and 
the notation therein). The estimate dpllxkXk) < o^'/^fi (q^ (k) ^'^^^^ i^j^) in that 
proposition implies by Theorem 12.21 that the returned point xk is an approximate zero of 
qk = f with associated zero Ci. 

(ii) The first instruction in ALHVar swaps / by — / if ds{f,g) > f • The reason to do so 
is that for nearly antipodal instances of / and g the difference ds{f, /) may be arbitrarily 
magnified in (is(gT, $t)- This does not occur under the assumption of infinite precision and 
this is why such swap is not in the algorithms described in [H [6] . 

Let h be either — / or / (according to whether ALHVar did the swap or not), K{h,g,x) 
be the number of iterations performed by ALHVar, and {((?r, Ct)} be the lifting of the path 

Eg,h- 

Let k < K{h,g,x) be a positive integer and consider any i g {0,...,fc — 1}. Using 
Proposition 14. 41 for qi,qi+i together with ([20|) implies that, for all r G [tT,?^^!!], 

fJ'norniiqi, Xi) /- \ ^ M , \2 l~ \ 

7— < Mnorm(q'r, Cr) < (1 +f^) [inomMi ■, X i) . (33) 
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Therefore, 

r-nTT i-tTTT ,.2 :=r-\ 



Mnorm 

^ lri!)^^..3/2X„(,,^) (b.Propositio.ra 
3A 



4(l + e)4aL>3/2' 
If fc = K{h,g,x) < oo this imphcs 



^2 / 3A \ . 1 _ 1 



4(l + e)V aDS/s - 408ai:»3/2' 



which proves that 



K{h,g,x) < 408ds{h,g)D^/^ f f^l^^Jqr, Cr) dr = B{h,g,C). 

Jo 



(34) 



It folfows that the number of iterations K{f,g,x) satisfies either K{f,g,x) < B{f,g,() or 
K{f,g,x) < B{—f,gX)- Certainly — and this introduces a factor of 2 but siniphfies the 
exposition — 

Kif,g,x)<Bif,g,0+B{-f,g,0. 

In case K{h, g,x) = oo (a non-halting computation) it implies that fj,^^j.^^{qr , C,T)dT = oo. 

The bound for costALHVar follows from the 0{N) cost of each iteration of ALHVar men- 
tioned in 



(iii) For i = 1, . . . , fc — 1, due to (u,i), 



_ B{q,,Xi) 
Ui > = \ l 



^nD"^ {log N + D + n2) max^gf^j^,^] /xLrm(aT, Cr) 
the last by ([55]) . The statement now follows from the equalities 

C) = minuj and ^* (/, 5, () = max max , A^norm(gT, Cr)- □ 

i<k i<k r(E[ri,ri+i] 

5 Proof of Theorem B 

We follow here the proof of the corresponding result for MD in [5] and begin by recalling 
two facts from this article. The first estimates the mean square condition number on the 
path when an extremity is fixed. 

Theorem 5.1 (Theorem 10.1 in [S]). For g e S'('H(d)) \ S we have 

E f4(/,g) / M2(9r)rfr') < 818 i?3/2^(n+l)MLx(5) + 0.01. □ 
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The second bounds the condition of U . 

Lemma 5.2 (Lemma 10.5 in 0). The maximum of the condition numbers /imax(C^) ■= 
'^^^z:I/(z)=o{A*norm(C/, z)} Satisfies 

mLx(C^) < 2n maxl(n + if--' < 2 {n + 1)^. □ 

i<n OjI 

The foUowing proposition bounds the maximum /i* (/, ,9, C) of the condition number along 
a path from C,) to / in terms of the number of iterations of ALHVar to follow this path 
and of the condition number finormis, C) of the initial pair. 

Proposition 5.3. Let f,g e S'('H(d)) and C a zero of g. The largest condition number 
(7, C) along the path from {g,Q to f satisfies 

M*(/,.9,C) < (l+£)^(^'«'^Vnorm(.9,C). 

Proof. Write k K{f,g,() and let ^^.i := m ax f^normiqr , Ct) ■ With this notation, 

Te[T-.T-p[] 

we have fi^,{f,g, Q = max Furthermore, (PT|) states that, for all i < k — 1, 

z— 0,...,fc— 1 

< (1 + e)fj. 

norm 

and an immediate recursion yields 

M*(/>5iC)= max < (l+£)''Mnorm(ff, 0- ^ 

ie{i...,fe-i} 

We remark that from the unitary invariance of our setting, for any unitary transformation 
ly e U{n + 1) and any g S 'H{d) and x G P", 

Mnorm(5',a;) = /i„orm O \ ^^2;) . 

Furthermore, for any execution of ALHVar on an admissible input {f,g,x), the number of 
iterations K{f,g,x) during the execution satisfies K{f,g,x) = K{f o v^' ,g o ^vx) for 
any unitary transformation i/ € Z^(n + 1). 

But one can remark also that any zero Zi of U is the image of 2:1 = -^(1, . . . , 1) by a 

unitary transformation ui that leaves U invariant. Thus, K{f,U,zi) = K{f ° i^J^,U,Zi) 
for all zeros Zi of U, and ^max(C/) = Mnorm(C^, zi). 
We also obtain immediately 

_ 1 ^ _ 

U, Zi)^-Y^ K{f o t/, z,). (35) 

But for all measurable functions tp: S{T-L[d.)) ~^ R and all 1/ G U{n + 1) we have 

E </?(./)= E vif^v). 

due to the isotropy of the uniform measure on S'('H(d)). 
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Therefore, ([55]) implies 



V 



E K{f,U,z,)= E VA'(/,C/,^,)- (36) 

Proof of Theorem B. From Lemma [Ol ^norm(C^, ^i) < \/2 {n + 1)^^'^ , and thus the 
initial value for u in algorithm MDVar is less than or equal to B(C/, Zi). Therefore, the tuple 
{f,U,Zi,u) given to ALHVar during the execution of MD is an admissible input, and we 
can apply Theorem 14.31 to that execution of ALHVar. In particular, it follows that MDVar 
almost surely stops and when it does so, it returns an approximate zero of /. 

We next bound the average cost of MDVar. Recall, we denoted by K{f, U, zi) the number 
of iterations of ALHVar during the execution of MDVar with input /. Again by Theorem l4.3[ 
for any root Zj of U we have 

K{f, U, z,) < B{f,U, z,) + B{-f, U, z,). 

But we obviously have that Efes{Hi^d))^(f^^^^i) = -B(-/, ?7, 2;^), and thus 

from ((36)) . 

V 



E K{f,U,z,) < 2 E Vi3(/,f/,2,) 

/6S(«(d)) /es(«(d)) 25 

/eS(W(d)) Jo ^ r~f 



= 816I?3/2 E ds{f,U) f filiqr)dT, 
/es(w,d)) Jo 

the last line by the definition of the mean square condition number ([2]). 
Applying successively Theorem 15.11 and Lemma 15. 2[ we get 

E K{f,U,zi) < 816D^/^{8l8D^/'^N{n + l)nl^^^{U) + 0.01) 

/6SCH(d)) 

< 816i:»^/^(818 D^/^N{n + 1) • 2 + 1)^ + 0.01) 

< 667489 D^N{n + l)°+'^. (37) 

The bound for the average of costMDVar follows from the 0{N) cost of each iteration of 
ALHVar. 

We finally bound the average of the precision needed. From Theorem 14.31 (iii), the finest 
precision U, Zi) along the execution of ALHVar (and therefore, along that of MDVar ) 
satisfies, for some universal constant c, 

u*(f, U.Zi) > = . 

' - cnD-^{\ogN + D + n^)^il{f,U,Zi) 

Hence by Proposition 15. 31 

I log (/,!/, 21 ) I < \og{cnD''{\ogN + D + 'n?)^ll{f,U,Zl)) 

= 2 log (/, C7, 21 ) + \og{cnD^ (log N + D + n^)) 

< 2K{f, [7, zi) log(l + e) + 2 log /^„orm(C7, zi) + \og{cnD^{\og N + D + j 
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Using Lemma 15.21 and ([57)) , we finally obtain 



E \\ogu4f,U,Zi)\ 

/6S(«(d)) 

<2 E (A'(/,I7,2i)log(l + e) + log/i 

norm 

/6SCH(d)) 

+ \og{cnD^{\ogN + D + n^))) 
< log(l + s) ■ U3m8D^N{n + 1)-°+^ 

+ Dlog{V2{n + I)) + 2\og{cnD^{log N + D + n^)) 

We observe that the initial precision u — „ (log N+D+n^)2{n+i)^ satisfies this inequal- 
ity. 

6 Proof of Theorem A 

We assume now a fixed round-off unit u. In this context wc consider the following trivial 
variation of ALHVar: 

Algorithm ALHFix 
input {f,g,x) 

if Mf^g) > f then g := -.g 

r := 0, Qr := g 

repeat 

if u > ^B{qr,x) RETURN "Failure" 

T := min{l, t + At} 
:=t(r)/ + (l-t(r))g 

n 

x:=iV,^(x) 

until T = 1 
RETURN x 

The idea is simple: the precision in ALHFix remains constant and the algorithm proceeds 
until either it halts returning an approximate zero a; of / or it halts returning the message 
''Failure''. The latter indicates that the precision is not sufficient to guarantee the 
correct execution of the algorithm. In the former case, we say that ALHFix successfully 
halts. 

We can make ALHFix and ALHVar even closer by taking advantage of the level of general- 
ity we used to define the rounding functions r„ and the input-reading functions read_input. 
Recall, the main property of r„ is that r„(x) = x(l -I- S) with |^| < u. Similarly, we have 
made no assumptions on the functions read_input f besides the fact that they return a 
rounded-off reading of the input system / with the current precision. 
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For the rest of this section we assume that the rounding maps {r^ | m S (0, 1)} satisiy 
fu = Tu for ah u >u. We also assume that, for all h G H{d) the black-box read_input;i is 
given by read_input/ift,( ) = r„(/i) where u is the current precision. All the results shown 
in Sections |4] and [5] hold in general and, a fortiori, under these assumptions as well. In 
addition, we have the following trivial lemma. 

Lemma 6.1. Let h £ 7i(^d) andf = ru{h). If the computation of ALHfix on input (f,U, Zi) 
successfully halts then this computation coincides with the computation o/ALHVar on input 
{h,U,Zi). In particular, they perform the same number of iterations. Otherwise, both 
computations coincide until a precision finer than u is required, at which moment ALHVar 
proceeds but ALHFix halts with a failure message. □ 



Proof of Theorem A. Algorithm MDFix is what one would expect: 

Algorithm MDFix 
input f eHu 

run ALHFix on input {f,U,Zi) 

Because of Lemma 16. 1[ for / G Hu we have 

MDFix returns ' 'Failure' ' with input / ■^=^ for all h £ r-^{f), u^{h,U,Zi) < u 

where, we recall u^,{h,U, Zi) is the finest u^, required by MDVar with input h. Because of 
the equalities = /^(%^(/)) '^^ therefore have 

ProbjMDFix returns ' 'Failure' '} = Prob \u^,(h,U,Zi) <u}. 

h~Af(OJd) 

Theorem 14. Sf iii) guarantees that, for some constant A, 

1 



U,zi) > 



AnD^logN + D + n)^iiif, U, Zi) 
Therefore, 
Prob{MDFix returns ' 'Failure' '} 

-J^ti.A^"'^^^^^''^ > AunD^{logN + D + n) ) 

-h~w(o,id)l ^ ' ' 1^ 21og(l+e) 21og(l + e)i 

the latter by Proposition 15. 31 Let 

X := K[h, U, z,) + 21og(l + £) ■ 

Then X is a positive random variable and 

E X ^ OiD^N{n+l)"+^) 

h~N{OM) 
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by ([57]) and the bound /i^o„-,j(?7, Zi) < 2{n + l)^ fLemma lO)) . We can now apply Markov's 
inequality to X — i.e., Prob{X > t} < — and we finally obtain 

'D^N(n + 1)^+1 
Prob{MDFix returns "Failure"} C ^ ' 



log(l/^l) 

This shows the first assertion. To see the second, let Kf;y.{f,U,Zi) denote the number of 
iterations performed by ALHFix with input (/, U, Zi). 

Let th be the value of r when MDFix halts on input (/, U, Zi), and let 

//.(/,[/, 2:1) max (/^norm('7T, Cr)) ■ 

Te[0,TH] 

By the proof of Theorem 14.3^ 11) and Lemma [6. 1[ the number of iterations ifpixl/, U, Zi) of 
MDFix is bounded as 

K,M,U,zi) = o(^D^^^ J^\il,,,Jqr,Cr)dT^ ^o{D^/'-lil{f,U,zi)) . (38) 
Let j be such that the maximum ^J.t{f, U , Zi) is attained in the interval [tj, Tj+i]. From (j2ip . 

^il{^,U,Zl) < (l+£)Vnormfe,0)- (39) 

By Proposition 14. 5f u) and (PO)) . the precision u satisfies 

This inequality, together with (p9| . imply 

_ „i52(iogjv + D + n2)w' 
and replacing this bound in finally yields 

KpM,U,Zi) ( ^ i ). □ 
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